Towards Better Understanding Of Anaphora

نویسنده

  • Barbara Dunin-Keplicz
چکیده

This paper presents a syntactical m e t h o d of interpreting pronouns in Polish, Using the surface structure of the sentence as well as grammatical and inflexional inlormation accessible during syntactic analysis, an area of reference is marked out for each personal and possessive pronoun. This area consists of a few internal areas inside the current sentence and an external area, i.e. the part of the text preceding it. In order to determine that area of reference several syntactic sentence-level restrictions on anaphora interpretation are formulated. Next, when looking at the area of pronoun's reference, all NPs which number-gender agree with the pronoun can be selected and this way the set of surface referents ol each pronoun can be created. It can be used as data for further semantic analysis. I I N T R O D U C T I O N Reference is one of the central concepts of any linguistic theory. In recent research into anaphora the term "reference" has been used in three different senses ( Szwedek, 1981): ( a ) a s a r e l a t i o n b e t w e e n the name a n d the t h i n g n a m e d ( H a l l P a r t e e , 1 9 7 8 ) ( b ) a s a n a s s o c i a t i o n b e t w e e n n o u n p h r a s e s a n d m e n t a l e n t i t i e s in the l a n g u a g e u s e r ' s ( N a s h % ~ e b b e r , 1 9 7 8 ) ( c ) a s a n a s s o c i a t i o n b e t w e e n t h e o c c u r r e n c e of p h r a s e s in t he t e x t ( R e i n h a r t , 1 9 8 1 ) H o w e v e r the r e f e r e n c e i s u n d e r s t o o d , irl o r d e r to i n t e r p r e t c o r r e c t l y a n a p h o r a o n the s e m a n t i c l e v e l ( ( a ) a n d ( b ) ) , f i r s t a s t a g e (C) is necessary. in this paper I have taken the point of vie~ presented under (c). i shall discuss the problem o~ onaphora in Polish ser Atences. rvly altentioF, is focused on personal o n d p o s s e s s i v e p r o n o u n s e x p t i c i t e l y o c c u r r i n g in the text and moreover on zero pronouns, i.e. ellipsis of NP in the subject position, specific for Slavonic languages. My purpose in the description of regularities of the reference in the Polish language. I shall express them by defining the area of pronoun's references, i.e. those regions of the text where its antecedents should De found, q hese surface referents will be selected from among NPs occurring in the sentence. The research on anaphora made for English has led to the formulation of some structural rules using such relations as command, c-command and precede-and-command ( R e i n h a r t , 3.981). I have been searching for analogous rules for Polish. But two essential differences have to be considered: ( i ) grammatical and morphological properties of Polish and English; (ii) different grammatical traditions. For English the rules concernig the coreference of entities were forrrulated on the basis of generative-transformational grammar. For Polish the first precise description of Polish syntax was formulated only recently by Szpakowicz, who based his work on the framework c r e a t e d by S a l o n i (Saloni, 1976; Saloni and Swidzinski, 1981). It is a kind of in,mediate-constituent grammar; the grammatical categories (case, ~ender, etc) are applied not only to single words, but also to compound phrases. In my present ~vork I have limited my attention to the subset of Polish described by Szpakowicz (Szpako~Jvicz, 1983). Folish is a highly inflexionat language and this fact has many and varied consequences. Surface referents of the pronoun will be selected from among those NPs which number-gender agree with the pronoun. Strictly speaking, the grammatical categories of the pronoun should be compatible with the categories of the NP, but in cases of neutralization they cannot be fully determined. My method of determining the areas of pronoun's reference is a syntachc one, because it is based on morphological and syntactical properties of the Polish language. I assume

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Saara Framework

The determination of reference and referential links in discourse is one of the important challenges in natural language understanding. The first commonly adopted step towards this objective is to determine coreference classes over the set of referring expressions. We present a modular framework for automatic anaphora resolution which makes it possible to specify various anaphora resolution alg...

متن کامل

I-34: NRY Haplotype Analysis: towards A Better Understanding of The Genetic Basis of Spermatogenic Failure

It has been established that the Y chromosome carries genes required for spermatogenesis and male fertility. For many decades worldwide screening for gene identification has been conducted in research laboratories. However, it has been a difficult process in identifying such genes (i.e. causative mutations) which could explain the phenotypic variation and could be potentially used as markers fo...

متن کامل

The Impact of Zero Pronominal Anaphora on Translational Language: a Study on Romanian Newspapers

This study investigates the impact of zero pronominal anaphora for Romanian on a learning model able to distinguish between translated and non-translated texts. Even though the correct understanding of ellipsis from the source language and its mapping into the target language is essential in the translation process, zero pronominal anaphora has been scarcely investigated in the context of trans...

متن کامل

Using Anaphora Resolution in a Question Answering System for Machine Reading Evaluation

This paper describes UAIC1’s Question Answering for Machine Reading Evaluation systems participating in the QA4MRE 2013 evaluation task. We submitted two types of runs, both type of runs based on our system from 2012 edition of QA4MRE, and both used anaphora resolution system. Differences come from the fact the textual entailment component was used or not. The results offered by organizer showe...

متن کامل

Probabilistic Resolution of Anaphoric Reference

This paper describes the use of a Bayesian network to resolve anaphora by probabilistically combining linguistic evidence. By adopting a Bayesian approach, we are able to combine diverse evidence in a principled way, extend current understanding of linguistic phenomena by quantifying relationships empirically, and better model the non-deterministic role of linguistic evidence in resolution of a...

متن کامل

ITRI-00-06 Coreference Annotation: Whither?

The terms coreference and anaphora tend to be used inconsistently and interchangeably in much empirically-oriented work in NLP, and this threatens to lead to incoherent analyses of texts and arbitrary loss of information. This paper discusses the role of coreference annotation in Information Extraction, focussing on the coreference scheme defined for the MUC-7 evaluation exercise. We point out ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1983